Knowledge Transfer Pre-training

نویسندگان

  • Zhiyuan Tang
  • Dong Wang
  • Yiqiao Pan
  • Zhiyong Zhang
چکیده

Pre-training is crucial for learning deep neural networks. Most of existing pre-training methods train simple models (e.g., restricted Boltzmann machines) and then stack them layer by layer to form the deep structure. This layerwise pre-training has found strong theoretical foundation and broad empirical support. However, it is not easy to employ such method to pre-train models without a clear multi-layer structure, e.g., recurrent neural networks (RNNs). This paper presents a new pre-training approach based on knowledge transfer learning. In contrast to the layer-wise approach which trains model components incrementally, the new approach trains the entire model as a whole but with an easier objective function. This is achieved by utilizing soft targets produced by a prior trained model (teacher model). Compared to the conventional layerwise methods, this new method does not care about the model structure, so can be used to pre-train very complex models. Experiments on a speech recognition task demonstrated that with this approach, complex RNNs can be well trained with a weaker deep neural network (DNN) model. Furthermore, the new method can be combined with conventional layer-wise pretraining to deliver additional gains.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تاثیر آموزش بهداشت بر آگاهی رابطین بهداشت مرکز بهداشت بیرجند درباره شیوه های زندگی سالم

  Background and Aim: The main goal of health coordinating volunteers' program is the promotion of their knowledge and skills through an active and favorable instructional system .Holding different training courses on healthy life styles covering nutrition, mobility, stress management, and life skills seem necessary for health coordinating volunteers so that they could learn health life skills,...

متن کامل

Effect of structured training programme on the knowledge and behaviors of breast and cervical cancer screening among the female teachers in Turkey

BACKGROUND Breast cancer and cervical cancer are the most common cancers among women in the world. Many studies on the early detection of cancer have been conducted among women worldwide, but few studies have been performed in the world on female teachers regarding breast self-examination (BSE), mammography (MMG) and Pap smear test (PST). As teachers interact with students, this could play an i...

متن کامل

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning

Transferring the knowledge learned from large scale datasets (e.g., ImageNet) via fine-tuning offers an effective solution for domain-specific fine-grained visual categorization (FGVC) tasks (e.g., recognizing bird species or car make & model). In such scenarios, data annotation often calls for specialized domain knowledge and thus is difficult to scale. In this work, we first tackle a problem ...

متن کامل

Effects of pre-training using serious game technology on CPR performance – an exploratory quasi-experimental transfer study

BACKGROUND Multiplayer virtual world (MVW) technology creates opportunities to practice medical procedures and team interactions using serious game software. This study aims to explore medical students' retention of knowledge and skills as well as their proficiency gain after pre-training using a MVW with avatars for cardio-pulmonary resuscitation (CPR) team training. METHODS Three groups of ...

متن کامل

Progressive Neural Networks for Transfer Learning in Emotion Recognition

Many paralinguistic tasks are closely related and thus representations learned in one domain can be leveraged for another. In this paper, we investigate how knowledge can be transferred between three paralinguistic tasks: speaker, emotion, and gender recognition. Further, we extend this problem to cross-dataset tasks, asking how knowledge captured in one emotion dataset can be transferred to an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1506.02256  شماره 

صفحات  -

تاریخ انتشار 2015